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ADDER WITH IMPROVED OVERFLOW FLAG GENERATION 
Related Applications 

i^v\ T^re-ptesent application is related to U.S. Patent Application Serial No. 09/291,677 filed 
April 14, 1999 in theitajn^f inventors M. Besz et al. and entitled "Prefix Tree Adder with Efficient 
Carry Generation," U.S. Patent Application Serial No. 09/525,644 filed March 1 5, 2000 in the name 
of inventors A. Goldovsky et al. and entitk4"Prefix Tree Adder with Efficient Sum Generation," 
and U.S. Patent Application Serial No. 09/5 69,0^^^fi4ed May 1 1, 2000 in the name of inventors A. 
Goldovsky et al. and entitled "Incorporation of Split- Addert^^i^wi thin a Carry-Skip Adder without 
Additional Propagation Delay," all of which are incorporated by referfeH^herein. 

Field of the Invention 

The present invention relates generally to adder circuits for use in semiconductor integrated 
circuits and other electronic devices, and more particularly to techniques for generating overflow 
flags in such adder circuits. 

Background of the Invention 

In a conventional n-bit prefix tree adder, the addition of two numbers A and 5, 



n-2 

J 



y=o 



represented in two's complement binary form, can be accomplished by computing: 



A. Goldovsky 9 



gj = Ojbj 

Pj = ® = Sjtj 

Sj = Pj ® Cj_^ 



> vy 0< j<n 



where c i is the primary carry-input signal. The signals designated gj, tj,pj and c, are referred to as 
generate, transmit, propagate and carry signals, respectively. The resulting sum of A and B is 



H j=o 
W 

w 

ffi An overflow occurs, and the resulting sum is invalid, if 

m 
m 

L OVF=c„_,®c^_, = l , 



no where OVF is an overflow flag. 

FIG. 1 shows an example of a logic circuit 1 00 that may be used to generate the overflow flag 
OVF in a conventional «-bit prefix tree adder of the type described above. The circuit 100 thus 
represents a portion of the conventional adder, and in this example includes a two-input OR gate 
102, a two-input AND gate 104, a first two-input exclusive-or (XOR) gate 106, and a second two- 

15 input XOR gate 108. The primary carry-output signal c^., is generated as an output of the OR gate 
102 and is applied as an input to the XOR gate 108. The carry signal c„.2 is applied as an input to 
both the XOR gate 106 and the XOR gate 108. The XOR gate 106 generates the sum signal as 
the exclusive-or of the carry signal c„.2 and the propagate signal The sum signal s^_^ is the final 
sum bit of the n-bit adder. The XOR gate 108 generates the overflow flag OVF as the exclusive-or 
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of the carry signals and c„,2- It is important to note that in this conventional circuit, the overflow 
flag OVF is generated after the addition is completed. Unfortunately, this results in an excessive 
amount of computational delay. 

A gate level model may be used to quantify the computational delay of the FIG. 1 logic 
circuit 100. In accordance with such a model, a 2-input gate such as a NAND or NOR gate may be 
specified as having a delay of 5, while XOR/XNOR, AOI (and-or-invert), OAI (or-and-invert) and 
2-to-l multiplexer gates each have a delay of 1.55. Using this model, the computational delay 
associated with generation of the overflow flag OVF in the logic circuit 1 00 is 35, corresponding to 
an adder delay of 1.55 plus a delay of 1.56 for the XOR gate 108, as is illustrated in the figure. 

Improved adders which can provide significant reductions in logic depth, computational 
delay and circuit area relative to conventional adders are disclosed in the above-cited U.S. Patent 
Applications Serial No. 09/291,677, Serial No. 09/525,644 and Serial No. 09/569,022. Although 
these improved adders provide substantial advantages over conventional adders, a need nonetheless 
remains for further improvements, particularly in terms of reducing the above-described 
computational delay associated with overflow flag generation. 

Summary of the Invention 

The invention provides improved techniques for generation of an overflow flag in a prefix 
tree adder or other type of adder circuit. In accordance with one aspect of the invention, an overflow 
flag is generated in parallel with generation of at least one of a sum signal and a primary carry-output 
signal of the adder. As a result, the computational delay associated with overflow flag generation 
is substantially reduced. Advantageously, this improvement is provided without requiring any 
increase in the transistor count or circuit area of the adder. 

In an illustrative embodiment of the invention, an w-bit adder includes a number of 
computational stages each associated with one or more bit positions. Particular ones of the 
computational stages generate a sum output signal in the form of a final sum bit s„_^ of the n-hit adder 
and a primary carry-output signal of the w-bit adder. A flag generation circuit is coupled to at 
least one of the stages and is operative to generate an overflow flag for the adder substantially in 
parallel with the generation of the sum output signal and the primary carry-output signal c„., . The 
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flag generation circuit is configured such that it does not require the primary carry-output signal 
to generate the overflow flag for the w-bit adder. 

In accordance with another aspect of the invention, the flag generation circuit for an «-bit 
adder may generate an overflow flag OFF as: 



where c„.2 is an n-2 carry signal of the adder, is an n-\ transmit signal of the adder, and g„_^ is an 
w-1 generate signal of the adder. As a result, the generation of the overflow flag OVF does not 
require the use of a primary carry-output signal c^.j of the adder. 

In accordance with a further aspect of the invention, the flag generation circuit may include 
a multiplexer which selects one of a plurality of input signals for propagation to its output as the 
overflow flag based at least in part on a signal associated with the signal line of at least one of the 
computational stages. For example, the overflow flag OVF described above can be generated using 
a 2-to-l multiplexer which receives as a first input a transmit signal receives as a second input 
a generate signal g„_i , and receives as a select signal a carry signal c„.2. Depending on the value of 

the select signal, the multiplexer selects either the transmit signal r^., or the generate signal g„_i for 

propagation in inverted form to its output as the overflow flag. 

The efficient overflow flag generation techniques of the present invention provide a 
substantially reduced computational delay relative to the above-described conventional adder 
architectures. The techniques of the invention are applicable to a wide variety of adders, including 
prefix tree adders, carry-lookahead adders, carry-skip adders, carry-ripple adders, carry-save adders 
as well as other types of adders, and to both radix-2 and non-radix-2 implementations of such adders. 
These and other features and advantages of the present invention will become more apparent from 
the accompanying drawings and the following detailed description. 
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Brief Description of the Drawings 

FIG. 1 is a schematic diagram of a conventional logic circuit for computing an overflow flag 
in an «-bit prefix tree adder. 

FIG. 2 is a schematic diagram of a logic circuit in accordance with an illustrative 
embodiment of the invention for computing an overflow flag in an n-bit prefix tree adder or other 
type of adder. 

FIG. 3 shows an example of an /z-bit prefix tree adder in which the logic circuit of FIG. 2 
may be implemented. 

Detailed Description of the Invention 

The present invention will be illustrated below in conjunction with an example prefix tree 
adder. It should be understood, however, that the invention is not limited to use with any particular 
type of adder, but is instead more generally applicable to any type of adder in which it is desirable 
to significantly improve the computational delay associated with generation of an overflow flag 
without requiring an increase in transistor count or area for the adder circuit, and thus without 
substantially increasing the cost or complexity of the adder circuit. For example, although illustrated 
using a radix-2 prefix tree adder, it will be apparent to those skilled in the art that the disclosed 
techniques are readily applicable to both radix-2 and non-radix-2 adders, as well as to other types 
of adders, such as carry-lookahead adders, carry-skip adders, carry-ripple adders and carry-save 
adders. 

FIG. 2 shows an example of a logic circuit 200 in accordance with an illustrative embodiment 
of the present invention. The logic circuit 200 may be used to generate the overflow flag OVF in 
a conventional n-bit prefix tree adder of the type described above, or in any of a number of other 
types of adders. The circuit 200 thus represents a portion of such an adder, and in this example 
includes a two-input OR gate 202, a two-input AND gate 204, a two-input exclusive-or (XOR) gate 
206, and a 2-to-l inverting multiplexer 208. The primary carry-output signal c„,^ is generated as an 
output of the OR gate 202, as in the conventional logic circuit 100 of FIG. 1. In addition, the carry 
signal c„.2 and the propagate signal p„,^ are applied as inputs to the XOR gate 206, which generates 
the sum signal as the exclusive-or of the carry signal c„.2 and the propagate signal also as in 
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the conventional logic circuit 100 of FIG. 1. The sum signal s„_^ is the final sum bit of the 
corresponding n-bit adder. 

In accordance with the invention, the overflow flag OVF is generated in parallel with 
generation of the sum signal s„_^ and the primary carry-output signal c„.,, using 2-to-l inverting 
multiplexer 208 as shown in FIG. 2. More specifically, the following formulation is used to define 
the overflow flag OVF in the illustrative embodiment: 

OVF = c„_, ® c„_, = c„_, 9 (g„_, + /„.,c„_,), 
which can be simplified as follows: 

Based on this simplification, the overflow flag OVF does not require the use of the primary carry- 
output signal c„.i, and thus can be generated in parallel with the output sum signal s„_^ and the 
primary carry-output signal c„.i, as is shown in FIG. 2. 

The 2-to-l inverting multiplexer 208 receives as a first input the transmit signal t„_^ and as 
a second input the generate signal . The carry signal c„.2 is also applied to the multiplexer 208 

as a select signal. Depending on the value of the select signal, the multiplexer 208 selects either the 
transmit signal or the generate signal for propagation in inverted form to its output as the 

overflow flag OVF. More particularly, if the carry signal c„,2 has a logic "one" value, the transmit 
signal is propagated in inverted form to the output of the multiplexer 208 as the overflow flag 
OVF, and if the carry signal c„_2 has a logic "zero" value, the generate signal g„^^ is propagated in 

inverted form to the output of the multiplexer 208 as the overflow flag OVF, The multiplexer 208 
thus implements the simplified equafion given above for the overflow flag OVF. Although the 
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multiplexer 208 in this embodiment is an inverting multiplexer, the invention can be implemented 
using other types of circuitry. 

Using the previously-described gate level model, the computation delay associated with 
generation of the overflow flag OVF in the circuit 200 of FIG. 2 is given by the adder delay 1.55, 
5 as is indicated in the figure. The additional 1.55 delay associated with the XOR gate 108 in the 
conventional logic circuit 100 of FIG. 1 is eliminated in this illustrative embodiment of the 
invention. 

Advantageously, the reduction in the computation time required to generate the overflow flag 
OVF in the illustrative embodiment is achieved without requiring any significant increase in the 
10 transistor count or circuit area of the adder, and thus without increasing adder cost or complexity. 

It should be emphasized that the logic circuitry in FIG. 2 is shown by way of example only. 
Those skilled in the art will recognize that numerous altemative arrangements of logic circuitry may 
be used to reduce the computational delay associated with overflow flag generation in accordance 
with the techniques of the present invention. 
^15 As noted previously, the present invention may be implemented in a number of different 

ff's types of adders. One such adder will now be described in greater detail in conjunction with FIG. 3. 

FIG. 3 shows a set of superimposed prefix trees 300 for an n-bit prefix tree adder of the type 
described in the above-cited U.S. Patent Application Serial No. 09/291 ,677, and in which the logic 
^ circuit 200 of FIG. 2 may be implemented. The general algorithm for an n-bit radix-2 prefix tree 
g20 adder of this type is described below. 

Step 1 (1 stage): 
Calculate 



y 



tj=aj^bj 



' Mj 0 < j <n 



7 



A. Goldovsky 9 

Step 2 ([logjw] stages): 

For /r = 1 ... riog2«l calculate 

•^j - GU-.< + ':-2'-.s-2- v-'' 2'" - 1 s ^- < 2'-'. 

(g;:,.*,.'::,. J = (G;.,.-..,.j:i,.-,,,)''(G;:£,j;^/.':i) 2*-' ^ y < « • 



3 (1 stage) 
Calculate 

and 

In the set of prefix trees of FIG. 3, the squares at the top of the figure compute gp tj and pj for each 
bit position in accordance with Step 1 . The empty circles apply the fundamental carry operator in 
accordance with Step 2. The filled circles represent buffers. The crossed circles compute carries in 
accordance with Step 2 and Step 3 above. The diamonds at the bottom of the figure generate the sum 
at each bit position from the pj signal in accordance with the equation of Step 3. It should be noted 
that the sum computation of in Step 3 occurs in parallel with the computation of the final carry 
output c^.i in Step 3. 

In the above description, (G/, Zy) = (gp tj), and 

(G/,T]')=(gj,tj)o(gj.,Jj_,)o,,,o{g,,0 if j>U 
where o is the fundamental carry operator. The carry Cj for each bit position is then given by 
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c. = Gi + Vc_, 

where c , is the primary carry input. If there is no primary carry input, then Cj is simply Gj, 

The logic depth of an w-bit prefix tree adder configured as shown in FIG. 3 is 2 + [1082^15 and 
5 the fanout of the carry input c , is 1 + pogjw]. The above-described algorithm can also be extended 
in a straightforward manner to higher radix prefix trees. 

Although static circuits are used in the prefix-tree adder 300 of FIG. 3, it should be noted that 
the invention may be implemented in an adder circuit which includes either static circuits, dynamic 
circuits or combinations of both static and dynamic circuits. Static circuits are often preferred to 
1 0 dynamic circuits because of their ease of design. 
Q The above-described illustrative embodiment of the invention may be configured to meet the 

^ requirements of a variety of different circuit applications, and may be implemented in adder circuits 

^ using any desired value of n. Moreover, as previously noted, a variety of other types of adders, 

W 

01 including non-radix-2 adders, may also be implemented using the techniques of the present 
^15 invention. 

^ Adders in accordance with the invention may be used as elements of many different types 

nj of circuits, such as, e.g., arithmetic logic units (ALUs), multiply-add units, and comparators. The 

O 

^ invention can be incorporated in a wide variety of integrated circuits or other processmg devices, 

2 including, e.g., microprocessors, digital signal processors (DSPs), microcontrollers, application- 

U 

20 specific integrated circuits (ASICs), memory circuits, telecommunications hardware and other types 
of processing devices. 

As a more particular example, the overflow flag generation techniques of the invention can 
be implemented in a straightforward manner in a DSP such as the DSP 16000 fi-om the 
Microelectronics Group of Lucent Technologies Inc., Allentown, PA, as described in DSP16000 
25 Digital Signal Processor Core, Information Manual, Lucent Technologies, July 1998, which is 
incorporated by reference herein. In the DSP 1 6000, two modules which can be configured to utilize 
the overflow flag generation techniques of the invention are the ADDSUB and ALU modules, both 
described in the above-cited Information Manual. 
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Numerous alternative embodiments of the present invention within the scope of the following 
claims will be readily apparent to those skilled in the art. 
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